Picture for Suha Kwak

Suha Kwak

TextME: Bridging Unseen Modalities Through Text Descriptions

Add code
Feb 03, 2026
Viaarxiv icon

Learned split-spectrum metalens for obstruction-free broadband imaging in the visible

Add code
Jan 27, 2026
Viaarxiv icon

VIRO: Robust and Efficient Neuro-Symbolic Reasoning with Verification for Referring Expression Comprehension

Add code
Jan 19, 2026
Viaarxiv icon

Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection

Add code
Nov 05, 2025
Figure 1 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 2 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 3 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Figure 4 for Part-Aware Bottom-Up Group Reasoning for Fine-Grained Social Interaction Detection
Viaarxiv icon

Improving Sound Source Localization with Joint Slot Attention on Image and Audio

Add code
Apr 21, 2025
Figure 1 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Figure 2 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Figure 3 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Figure 4 for Improving Sound Source Localization with Joint Slot Attention on Image and Audio
Viaarxiv icon

DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation

Add code
Apr 07, 2025
Figure 1 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Figure 2 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Figure 3 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Figure 4 for DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Viaarxiv icon

Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval

Add code
Apr 03, 2025
Viaarxiv icon

GENIUS: A Generative Framework for Universal Multimodal Search

Add code
Mar 25, 2025
Figure 1 for GENIUS: A Generative Framework for Universal Multimodal Search
Figure 2 for GENIUS: A Generative Framework for Universal Multimodal Search
Figure 3 for GENIUS: A Generative Framework for Universal Multimodal Search
Figure 4 for GENIUS: A Generative Framework for Universal Multimodal Search
Viaarxiv icon

Enhancing Cost Efficiency in Active Learning with Candidate Set Query

Add code
Feb 10, 2025
Figure 1 for Enhancing Cost Efficiency in Active Learning with Candidate Set Query
Figure 2 for Enhancing Cost Efficiency in Active Learning with Candidate Set Query
Figure 3 for Enhancing Cost Efficiency in Active Learning with Candidate Set Query
Figure 4 for Enhancing Cost Efficiency in Active Learning with Candidate Set Query
Viaarxiv icon

Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens

Add code
Jan 13, 2025
Figure 1 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 2 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 3 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Figure 4 for Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens
Viaarxiv icon